An Approach to the Decoder Complexity Reduction in Waveform Interpolation Speech Coding
نویسندگان
چکیده
Since current Text-to-Speech (TTS) synthesizers are mostly based on a technique known as synthesis by concatenation, the implementation of a high quality TTS requires huge storage space for a large number of speech segments. In order to compress the database in the TTS system, the use of speech coders would be an efficient solution. Waveform Interpolation (WI) has been shown to be an efficient speech coding algorithm to provide high quality speech at low bit rates. However, its applications are constrained due to the high computational complexities. Especially, for the application of speech compression for the TTS database, the decoder complexity is one of the major factors in order to utilize the speech coder in the TTS system. In this paper, we describe an approach to the complexity reduction in WI speech decoder, which is used for compressing the database of the TTS system. The proposed idea is able to reduce the complexity by removing the realignment process from the decoder procedure in WI. Since the realignment factor obtained in the encoder should be transmitted to the decoder in order to realign the characteristic waveforms, overall bit rate is slightly increased. We can reduce the decoder complexity by 20 percent utilizing the new approach to the realignment procedure.
منابع مشابه
Very low complexity interpolative speech coding at 1.2 to 2.4 kbps
The recently-introduced waveform interpolation (WI) coders [2, 4, 5, 6] provide good-quality speech at low rates but may be too complex for commercial use. This paper proposes new approaches to low-complexity WI speech coding at rates of 1.2 and 2.4 kbps. The proposed coders are 4 to 5 times faster than the previously reported ones . At 2.4 kbps, the complexity is about 7.5 and 2.5 MFLOPS for t...
متن کاملA Low-complexity Improved WI Speech Coding at 2kbps
The waveform interpolation (WI) speech coding presents a good performance at low bit rate. However, the algorithm has a very high complexity in computation. In this paper, a low-complexity improved waveform interpolation speech coder at 2kbps is proposed. The improved coding scheme has greatly reduced the computational complexity and improved the reconstructed speech quality by using various te...
متن کاملA Vector-Predictive Multi-Mode Matrix Quantization Approach for Parametric Speech Coding
In parametric speech coding, the accuracy of parameter quantization has a significant effect on speech quality. In this paper, we present a flexible and high-fidelity multi-mode quantization approach that combines the beneficial features of predictive vector quantization and matrix quantization. As an example, the proposed technique is employed in quantization of the power component in a wavefo...
متن کاملQuantization of Magnitude Spectra in Waveform Interpolation Speech Coding
Efficient quantization of magnitude spectra is one of the central issues in speech coding, especially with techniques operating at low bit rates. This paper presents a novel high-accuracy quantization scheme for the magnitude spectra in waveform interpolation speech coding. The performance of the proposed techniques, including both their accuracy and complexity, is studied using objective measu...
متن کاملA novel 2kb/s waveform interpolation speech coder based on non-negative matrix factorization
In this paper, a 2kb/s Waveform Interpolation speech coder is proposed based on non-negative matrix factorization (NMF). In characteristic waveforms (CWs) decomposition, band-partitioning initialization constraints were set to basis vectors before NMF was carried out. This decomposition method only requires speech signal from the current frame, and can yield high decomposition quality with low ...
متن کامل